IN THE SPECIFICATION 

Please amend the paragraph beginning on page 3, line 10, as follows: 
In another aspect of the invention, said calculating P(X 0 ... X n | So . . . S n ) comprises 
assigning a state label Si to each i* byte X; of the byte sequence so as to maximize the 
equation: 

P(Xo...X N |S„...S N ) = Po(So)W 3KIS " , B 5wl "] 

wherein Po(S 0 ) is the initial distribution of states; ^i-^is a "state-to-state" transmission 
transition matrix; and |5 <> is a "byte-from-state" matrix of the probabilities of 
generating a byte value Xj given a state Si. 

Please amend the paragraph beginning on page 4, line 7, as follows: 
In another aspect of the invention, 
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where As(Xj) are histogram functions of the a states and [[Sj]] £j(X0 are probabilities of 
associating noise with the noise state for bytes within r+7 ranges of byte values Xj. 

Please amend the paragraph beginning on page 5, line 1, as follows: 
In another aspect of the invention, 



2 



h A (X,=\) 0 



h A (X, = \27) 0 0 

6,(^=128) 0 0 



e,(*, =160) 0 0 

e 2 (JT, =161) h } {X, = 161) A 2 (A>161) 



e 2 (X,=254) />,(*, =254) A a (Jf l =254) 
e 3 (*, = 255) 0 0 



where h s (X{) are histogram functions of the states, and [[&j]] 8](Xi) are probabilities of 
associating noise with the ASCII state within a plurality of Xi ranges for a three-state byte 



sequence. 



Please amend the paragraph beginning on page 5, line 6, as follows: 

Another aspect of the invention further comprises, providing a valid three-state byte 
sequence having an ASCII state and comprising valid ASCII and two-byte characters, 
computing an ASCII histogram h A (X0 by a method comprising sampling valid ASCII text so 
as to measure the frequency of occurrence of each byte value; computing an unnormalized 
histogram of said sampling over the ASCII range of Xj; and normalizing said unnormalized 
histogram such that the entire column of the matrix containing said ASCII histogram said 
sums to 1, and computing a first-byte histogram h x (X{) by sampling valid two-byte text and 
computing the unnormalized first-byte histogram over the odd bytes, and normalizing said 
first-byte histogram such that the entire column of the matrix containing said first-byte 
histogram sums to 1 , and computing a second-byte histogram A 2 (Xi) by sampling valid two- 
byte text and computing the unnormalized second-byte histogram over the odd bytes, and 
normalizing said second-byte histogram such that the entire column of the matrix containing 
said second-byte histogram sums to 1 . 
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Please amend the paragraph beginning on page 6, line 18, as follows: 

Disclosed is a method of validating a byte sequence having a plurality of states 
including an ASCII state, the method comprising selecting the ASCII state as the noise state 
generating a most probable state sequence for the byte sequence by a method comprising: 
calculating P(X 0 ... X„ | So . . . S n ), representing the conditional probabilities of said byte 
sequence given a state sequence, wherein said calculating P(Xo ... X„ | So . . . S n ) comprises 
assigning a state label Si to each i ih byte Xj of the byte sequence so as to maximize the 
equation: 



wherein Po(S 0 ) is the initial distribution of states; ^i^is a "state-to-state" transmission 
transition matrix; and I( *< |5 '> is a "byte-from-state" matrix of the probabilities of generating a 
byte value Xi given a state Si; and utilizing said state sequence to identify all noise in the byte 
sequence, localizing said noise in said noise states, and deleting said noise from the byte 
sequence. 

Please amend the paragraph beginning on page 8, line 1, as follows: 

Figure 2 depicts a typical Markov model for allowable state sequences for mixed 
double-byte and ASCII sequences, such as GB-type byte sequences. The state of a byte in 
this example can be one of three, namely an ASCII character (state A), a first byte of a two- 
byte character state (state GB1), or a second byte of a two-byte character state (state GB2). 
The states are designated by the user, dependent upon the noise he wishes to detect, which in 
this example is invalid character codes. As can be seen in the diagram, a single-byte ASCII 
character (state A) can be followed by another ASCII character (state A) or by the first byte 
(state GB1) of a double-byte GB-type character, but never can an ASCII character (state A) 
be followed by the second byte (state GB2) of a double-byte GB-type character. This is 
shown by the directions of the arrows leading toward and away from the ASCII state A. 
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Likewise, a first GB byte (state GB1) may be followed by a second byte (state GB2), but 
never by an ASCII character (state A); and a second byte (state GB2) may be followed by an 
ASCII character (state A), or by a first byte (state GB1), but never by a fest second byte 
(state GB4-2). A violation of these rules is not permitted in the state string generated by the 
state sequence labeler 100 of Figure 1 and this is mathematically guaranteed by the zero 
entries in the "state-to-state" transmission transition matrix -* (S < 1 s «> of Equation 4c, more fully 
described below. 

Please amend the paragraph beginning on page 9, line 13, as follows: 
We may then model the conditional probabilities of the byte sequence of Equation 1 
given the state sequence of Equation 2 as: 

P(X 0 ... X N | So . . . S N ) - Po(So) L '- 1 JL '-° J (3) 

where P 0 (S 0 ) is the initial distribution of states, namely P 0 (A) - P 0 (l) = P 0 (2) = 1/3; ^ s < i s «>is 
the state-to-state transmission transition matrix and will have the properties of the Markov 
model being utilized; B{X > 1 5 <) is the "byte-from-state" matrix of the probabilities of generating 
a byte value Xi given a state Si. 



5 



